AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Large model inference optimization

# Large model inference optimization

Llama 4 Scout 17b 16e It Gguf
Other
An image-text to text conversion model built on the Meta Llama base model, supporting interaction through gguf-connector and llama-cpp-python.
Image-to-Text
L
chatpig
258
0
Llama 3.1 70B Instruct GGUF
An ultra-low-bit (1-2 bit) quantized model based on Llama-3.1-70B, utilizing IQ-DynamicGate technology for adaptive precision quantization, enhancing accuracy while maintaining memory efficiency.
Large Language Model Supports Multiple Languages
L
Mungert
19.52k
3
Featherless Ai.qwerky QwQ 32B GGUF
Qwerky-QwQ-32B is a large language model with 32B parameters, specializing in text generation tasks.
Large Language Model
F
DevQuasar
578
2
Sky T1 32B Preview GGUF
Sky-T1-32B-Preview is a 32B-parameter large language model, quantized using llama.cpp's imatrix, suitable for text generation tasks.
Large Language Model English
S
bartowski
1,069
81
Mixtral 8x22B V0.1 GGUF
Apache-2.0
Quantized version of Mixtral-8x22B-v0.1, using llama.cpp for quantization, supporting multiple languages and quantization types.
Large Language Model Supports Multiple Languages
M
bartowski
597
12
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase